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Abstract 

One of the simplest ways to decide whether a given finite sequence of 
positive integers can arise as the degree sequence of a simple graph is the 
greedy algorithm of Havel and Hakimi. This note extends their approach 
to directed graphs. It also studies cases of some simple forbidden edge-sets. 
Finally, it proves a result which is useful to design an MCMC algorithm 
to find random realizations of prescribed directed degree sequences. 
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1 Introduction 

The systematic study of graphs (or more precisely the tlinear graphs, as it was 
called in that time) began sometimes in the late forties, through seminal works 
by P. Erdos, P. Turan, W.T. Tutte, and others. One problem which received 
considerable attention was the existence of certain subgraphs of a given graph. 
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For example such subgraph could be a perfect matching in a (not necessarily 
bipartite) graph, or a Hamiltonian cycle through all vertices, etc. Generally 
these substructures are called factors. The first couple of important results of 
this kind are due to W.T. Tutte who gave necessary and sufficient conditions 
for the existence of 1-factors and /-factors. 

In the case of complete graphs, the existence problem of such factors is 
considerably easier. In particular, the existence problem of (sometimes sim- 
ple) undirected graphs with given degree sequences even admits simple greedy 
algorithms for its solution. 

Subsequently, the theory was extended for factor problems of directed graphs 
as well, but the greedy type algorithm mentioned above, to the best knowledge 
of the authors, is missing even today. 

In this paper we fill this gap: after giving a short and comprehensive (but 
definitely not exhausting) history of the /-factor problem (Section [2]) , we de- 
scribe a greedy algorithm to decide the existence of a directed simple graph 
possessing the prescribed degree sequence (Section In Section 2] we prove a 
consequence of the previous existence theorem, which is a necessary ingredient 
for the construction of edge-swap based Markov Chain Monte Carlo (MCMC) 
methods to sample directed graphs with prescribed degree sequence. Finally 
in Section [5] we discuss a slightly harder existence problem of directed graphs 
with prescribed degree sequences where some vertex-pairs are excluded from the 
constructions. This result can help to efficiently generate all possible directed 
graphs with a given degree sequence. 

2 A brief history (of /-factors) 

For a given function / : V{G) — ?> N U {0}, an f -factor of a given simple graph 
G{V,E) is a subgraph H such that dniv) — f{v) for all v G V. One of the 
very first key results of modern graph theory is due to W.T. Tutte: in 1947 he 
gave a complete characterization of simple graphs with an /-factor in case of 
/ = 1 (Tutte's 1-factor theorem, [II]). Tutte later solved the problem of the 
existence of /-factors for general /'s (Tutte's /-factor theorem, [H]). In 1954 
he also found a beautiful graph transformation to handle /-factor problems via 
perfect matchings in bipartite graphs [16]. This also gave a clearly polynomial 
time algorithm for finding /-factors. 

In cases where G is a complete graph, the /-factor problem becomes easier: 
then we are simply interested in the existence of a graph with a given degree 
sequence (the exact definitions will come in Section In 1955 P. Havel devel- 
oped a simple greedy algorithm to solve the degree sequence problem for simple 
undirected graphs (0). In 1960 P. Erdos and T. Gallai studied the /-factor 
problem for the case of a complete graph G, and proved a simpler Tutte-type 
result for the degree sequence problem (see [3j). As they already pointed out, 
the result can be derived directly form the original /-factor theorem, taking into 
consideration the special properties of the complete graph G, but their proof 
was independent of Tutte's proof and they referred to Havel's theorem. 
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In 1962 S.L. Hakimi studied the degree sequence problem in undirected 
graphs with multiple edges ([6]). He developed an Erdos-Gallai type result 
for this much simpler case, and for the case of simple graphs he rediscovered 
the greedy algorithm of Havel. Since then this algorithm is referred to as the 
Havel-Hakimi algorithm. 

For directed graphs the analogous question of recognizability of a bi-graphical- 
sequence comes naturally. In this case we are given two n-element vectors d"*", d~ 
of non-negative integers. The problem is the existence of a directed graph on 
n vertices, such that the first vector represents the out-degrees and the second 
one the in-degrees of the vertices in this graph. In 1957 D. Gale and H. J. 
Ryser independently solved this problem for simple directed graphs (there are 
no parallel edges, but loops are allowed), see [5l[T3]. In 1958 C. Berge general- 
ized these results for p-graphs where at most p parallel edges are allowed ([1]). 
(Berge calls the out-degree and in-degree together the demi-degrees.) Finally 
in 1973, the revised version of his book Graphs ([2]) gives a solution for the 
p-graph problem, loops excluded. To show some of the afterlife of these results: 
D. West in his renowned recent textbook ([TT]), discusses the case of simple 
directed graphs with loops allowed. 

The analog of /-factor problems for directed graphs has a sparser history. 
0ystein Ore started the systematic study of that question in 1956 (see [TTIIT^ ). 
His method is rather algebraic, and the finite and infinite cases - more or less - 
are discussed together. The first part developed the tools and proved the directly 
analog result of Tutte's /-factor problem for finite directed graphs (with loops), 
while the second part dealt with the infinite case. 

In 1962 L.R. Ford and D.R. Fulkerson studied, generalized and solved the 
"original" /-factor problem for a directed graph G ([4]). Here lower and upper 
bounds were given for both demi-degrees of the desired subgraph (no parallel 
edges, no loops) with the original question naturally corresponding to equal 
lower and upper bounds. The solutions (as well as in Berge's cases) are based 
on network fiow theory. 

Finally, in a later paper Hakimi also proves results for bi-graphical sequences, 
however, without presenting a directed version of his original greedy algorithm 
(see H). 

3 A greedy algorithm to realize bi-graphical se- 
quences 

A sequence d = {di, c?2, • ■ • , c?n} of nonnegative integers is called a graphical 
sequence if a simple graph G{V,E) exists on n nodes, V = {vi,V2, ■ ■ ■ ,Vn}, 
whose degree sequence is d. In this case we say that G realizes the sequence d. 
For simplicity of the notation we will consider only sequences of strictly positive 
integers (d„ > 0) to avoid isolated points. The following, well-known result, was 
proved independently by V. Havel and S.L. Hakimi. 
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Theorem 1 (Havel [8|, Hakimi [6]) There exists a simple graph with degree 
sequence di > 0, 1^2 > • • • > > (n > i) ij and only if there exists one 
with degree sequence ^2 — 1, . . . , dd^+i — 1, ^^1+2, • ■ • , d„. (Note that there is no 
prescribed ordering relation between di and the other degrees.) 

This can be proved using a recursive procedure, which transforms any reahzation 
of the degree sequence into the form described in the Theorem [U by a sequence 
of two-edge swaps. 

A bi- degree- sequence (or BDS ior short) (d+,d^) — ({dj'", dj, . . . , d^}, {d^^, (^2^, 
. . . , d^ }) of nonnegative integers is called a bi- graphical sequence if there exists a 
simple directed graph (digraph) G{V^ E) on n nodes, V = {ui, V2, ■ ■ ■ ,v„}, such 
that the out-degree and in-degree sequences together form (d+,d"). (That is 
the out-degree of vertex Vj is d'^ and its in-degree is dj .) In this case we say that 

G realizes our BDS. For simplicity, we will consider only sequences of strictly 
positive integer BDS's, that is each degree is > and -\- d~ > 0, to avoid 
isolated points. 

Our goal is to prove a Havel-Hakimi type algorithm to realize bi-graphical 
sequences. To that end we introduce the notion of normal order: we say that 
the BDS is in normal order if the entries satisfy the following properties: for 
each i = 1, . . . , n — 2 we either have d~ > d~'^^ or d~ = d~^-^ and > d^i- 
Clearly, all BDS-s can be arranged into normal order. Note that we made no 
ordering assumption about node Vn (the pair d+,d~). 

Theorem 2 Assume that the BDS (d+, d^) {with d'^ + dj > 0, j G [1, n]) is in 
normal order and d^ > (recall: the out-degree of the last vertex is positive). 
Then (d+, d^) is bi-graphical if and only if the BDS 

A+ _ / 4 */ kj^n , . 

" \ if k^n, ^ ' 

A - _ / '^fe - 1 k<d+ , . 

' ~ I rffe ^/ k > d+ , 

with zero elements removed ( those j for which = Aj — 0) is bi-graphical. 



Before starting the proof, we emphasize the similarity between this result and 
the original HH-algorithm. As in the undirected case, using Theorem[21 we can 
find in a greedy way a proper realization of graphical bi-degree sequences. 

Indeed: choose any vertex Vn with non-zero out-degree from the sequence, 
arrange the rest in normal order, then make d^ connections from Vn to nodes 
with largest in-degrees, thus constructing the out-ncighborhood of f„ in the (fi- 
nal) realization. Next, remove the vertices (if any) from the remaining sequence 
that have lost both their in- and out- degrees in the process, pick a node with 
non-zero out-degree, then arrange the rest in normal order. Applying Theo- 
rem 2 again, we find the final out-neighborhood of our second chosen vertex. 
Step by step we find this way the out-neighborhood of all vertices, while their 
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in- neighborhoods get defined eventually (being exhausted by incoming edges). 
Note, that every vertex in this process is picked at most once, namely, when its 
out-neighborhood is determined by the Theorem, and never again after that. 



Our forthcoming proof is not the simplest, however, we use a more general setup 
to shorten the proofs of later results. 

First, we define the partial order ^ among /c-element vectors of increasing 
positive integers: we say a ^ b iff for each j = 1, . . . ,k we have aj < bj. 

A possible out-neighborhood (or PON for short) of vertex w„ is a d^-element 
subset of \ {vn} which is a candidate for an out-neighborhood of Vn in some 
graphical representation. (In essence, a PON can be any d+-element subset of 
V\{vn} but later on we may consider some restrictions on it.) Let A be a PON 
of Vn- Then denote by i{A) the vector of the increasingly ordered subscripts of 
the elements of A. (For example, ii A = {v2,V4,vg}, then i{A) — (2,4,9).) Let 
A and B be two PONs of w„. We write: 

B^A <^ isdiiA- (3) 

In this case we also say that is to f/ie Ze/t of A. (For example, B = {^i, W2, wg, wy} 
is to the left of A = {v2, va, vq, vg}.) 

Definition 3 Consider a bi-graphical BDS sequence (d+,d^) and let A be a 
PON ofvn- The ^-reduced BDS (d+|^,d-|^) is defined as: 

^+1 _ / 4 if k^n , . 

fel^ \ if k^n, ^ ' 

I _ f dfc -1 */ k&i{A) 
^ \a \ dl if k^i{A). 

In other words, if A is a PON in a BDS, then the reduced degree sequence 
(d+|^,d~|^) is obtained by removing the out-edges of node Vn (according to 
the possible out- neighborhood A). As usual, if for one subscript k in the A- 
reduced BDS we have | ^ ~ '^fe I a ~ ^ then the vertex with this index is to be 
removed from the bi-degree sequence. 

Lemma 4 Let (d+,d~) be a BDS, and let A be a possible out-neighborhood of 
Vn. Furthermore let B be another PON with B = A \ {vk} U {vi} where d^ > djT 
and in case of = d'j^ we have df > d^. Then i/(D+,D^) := (d+|^,d^|^) 
is bi-graphical, so is (d+|^,d^|^). 

Proof. Since our A- reduced BDS (D+, D~) is bi-graphical, there exists a directed 
graph G which realizes the bi-degree sequence (D+,D~). We are going to 
show that in this case there exists a directed graph G' which realizes the BDS 
(d+|^,d^|^). In the following, VaVb will always mean a directed edge from 

node Va to node Vb. Let us now construct the directed graph Gi by adding 
VnV directed edges for each v € A. (Since according to (H)), in (D+,D~) the 
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out-degree of Vn is equal to zero, no parallel edges are created.) The bi-degree- 
sequence of Gi is (d"*",!!"). Our goal is to construct another realization G[ of 
(d+ , d~ ) such that the deletion of the out-edges of Vn in the latter produces the 
BDS(d+|^,d-|J. 

By definition we have VnVk G Ei, (the edge set of Gi) but VnVi ^ Ei. At 
first assume that there exists a vertex vg {£ ^ i,k,n), such that viVi G Ei 
but viVk ^ El. (When > d^T then this happens automatically, however if 

— and VkVi G Ei then it is possible that the in-neighborhood of Vi and Vk 
are the same - except of course Vk, Vi themselves and w„.) This means that now 
we can swap the edges VnVk and V£Vi into VnVi and viVk- (Formally we create 
the new graph G[ = {¥,£[) such that E[ ^ Ei\ {vnVk,vm} U {vnVi,veVk}.) 
This achieves our wanted realization. 

Our second case is when d~ — d'j^ , VkVi d Ei, and furthermore 

for each i =i i,k,n we have vgVi Cz Ei veVk G Ei. (6) 

It is important to observe that in this case ViVk ^ Ei : otherwise some vi would 
not satisfy (O (in order to keep d^ = d^)- 

Now, if there exists a subscript m (different from k, i, n) such that ViVm G Ei 
but VkVm ^ El, then we create the required new graph G'l by applying the 
following triple swap (or three-edge swap): we exchange the directed edges 
VnVk,VkVi and v^Vm into VnVi^v^Vk and VkVm- 

By our assumption we have df > d'^ . On one hand side if dj' > d'^ holds 

then due to the properties VkVi G E and ViVk ^ -E, there exist at least two 
subscripts mi, 7712 ^ i,k such that ViVmj G E but VkVmj ^ E and at least one 
of them differs from n. Thus, when df > d'^ , we do find such an m for which 
the triple swap above can be performed. 

The final case is when d^ = d^ and df = d'^ . If vertex Vm does not exist, 
then we must have ViVn G Ei (to keep d'^ — d'^), and in this case clearly, 
VkVn ^ El. Therefore, in this (final) case the graphical realization Gi has the 
properties VnVk,VkVi,ViVn G Ei and VnVi,ViVk,VkVn Ei. Then the triple swap 

E'l := El \ {VnVk,VkVi,ViV„} U {VnVi,ViVk,VkVn} (7) 

will produce the required new graphical realization G'^. □ 

Observation 5 For later reference it is important to recognize that in all cases 
above, the transformations from one realization to the next one happened with 
the use of two-edge or three-edge swaps. 

Lemma 6 Let (d+,d^) be an BDS and let A and C be two possible out- 
neighborhoods of Vn- Furthermore assume that C < A, that is C is to the left 
of A. Finally assume that vertices in A U C are in normal order. Then if 
(d+|^,d~|^) is bi-graphical, so is (d+|p,d~|p). 
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Proof. Since C is to the left of A therefore, there is a (unique) bijection (f) : 
C\A^A\C such that Vc S C \ A : i{{c]) < i{{(t){c)}) (the subscript of 
vertex c is smaller than the subscript of vertex 4i{c)). (For example, if A = 
{v4,V5,ve,V7,vs,vg} and C = {ui, U2, W3, ^5, ws}, then C\A = {wi,U2,W3}, 
A \ C = {u4, vq,vq}, and is the map {vi o U4, <r^ ve, W3 wg}). 

To prove Lemma [5] we apply Lemma 0] recursively for each c £ C \ A (in 
arbitrary order) to exchange (/<(c) G A with c e C, preserving the graphical 
character at every step. After the last step we find that the sequence reduced 
by C is graphical. □ 

Proof of Theorem [2j We can easily achieve now the required graphical realiza- 
tion of (d+, d~) if we use Lemma [5] with the current A, and C = {vi, . . . , Vj^+}. 
We can do that since (d+, d~) is in normal order, therefore the assumptions of 
Lemma [6] always hold. □ 

4 A simple prerequisite for MCMC algorithms 
to sample directed graphs with given BDS 

In practice it is often useful to choose uniformly a random element from a set 
of objects. A frequently used tool for that task is a well-chosen Markov-Chain 
Monte-Carlo method (MCMC for short). To that end, a graph is established 
on the objects and random walks are generated on it. The edges represent 
operations which can transfer one object to the other. If the Markov chain can 
step from an object x to object y with non-zero probability, then it must be 
able to jump to x from y with non-zero probability (reversibility). If the graph 
is connected, then applying the well-known Metropolis-Hastings algorithm, it 
will yield a random walk converging to the uniform distribution starting from 
an arbitrary (even fixed) object. 

To be able to apply this technique we have to define our graph (the Markov 
chain) 5(d+, d^) — (V, £). The vertices are the different possible realizations of 
the bi-graphical sequence (d+, d~). An edge represents an operation consisting 
of a two or three-edge swap which transforms the first realization into the second 
one. (For simplicity, sometimes we just say swap for any of them.) We will show: 

Theorem 7 Let Gi , G2 be two realizations of the same bi-graphical sequence 
(d+,d^). Then there exists a sequence of swaps which transforms Gi into G2 
through different realizations of the same bi-graphical sequence. 

Remark: In the case of undirected graphs the (original) analogous obser- 
vation (needing only two-edges swaps) was proved by H.J. Ryser ([IB]). 

Proof. We prove the following stronger statement: 

{^) there exists a sequence of at most 2e swaps which transform Gi into G2, 
where e is the total number of out-edges in (d"'",d~) 
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by induction on e. Assume that (^) holds for e' < e. We can assume that 
our bi-graphical sequence is in normal order on the first n — 1 vertices and 
d+ > 0. By Theorem [5] there is a sequence Ti (T2) oi d = many swaps 
which transforms Gi (G2) into a G[ (G'2) such that r+ = {vi,...,Vd} 

We consider now the directed graphs G" (Gj) derived from directed graph 
G'l (directed graph Gj) by deleting all out-neighbors of Vn- Then both directed 
graphs realize the bi-graphical sequence (A"'",A~) which, in turn, satisfies re- 
lations (H]) and ([2]). Therefore the total number of out-degrees is e — d in both 
directed graphs, and by the inductive assumption there is a sequence T of 2(e— d) 
many swaps which transforms G" into Gj- 

Now observe that if a swap transforms H into iJ', then the "inverse swap" 
(choosing the same edges and non-edges and swap them) transforms H' into H. 
So the swap sequence T2 has an inverse T-^ which transforms G2 into G2 . Hence 
the sequence T1TT2 is the required swap sequence: it transforms Gi into G2 
and its length is at most d + 2(e — d) + d = 2e. □ 

5 Is a BDS bi-graphical when one of its vertex's 
out-neighborhood is constrained? 

In network modeling of complex systems (for a rather general reference see [lOj ) 
one usually defines a (di)graph with components of the system being represented 
by the nodes, and the interactions (usually directed) amongst the components 
being represented as the edges of this digraph. Typical cases include biological 
networks, such as the metabolic network, signal transduction networks, gene 
transcription networks, etc. The graph is usually inferred from empirical ob- 
servations of the system and it is uniquely determined if one can specify all 
the connections in the graph. Frequently, however, the data available from the 
system is incomplete, and one cannot uniquely determine this graph. In this 
case there will be a set V of (di)graphs satisfying the existing data, and one can 
be faced with: 

(i) finding a typical element of the class V, 

(ii) or generating all elements of the class V. 

(A more complete analysis of this phenomenon can be found in [9j.) In Section^] 
we already touched upon problem (i) when V is the class of all directed graphs 
of a given BDS. The analogous Problem (ii) for undirected graphs was recently 
addressed in |5] which provides an economical way of constructing all elements 
from v. In this Section we give a prescription based on the method from [S], 
to solve (ii) for the case of all directed graphs with prescribed BDS. This is 
particularly useful from the point of view of studying the abundance of motifs 
in real-world networks: one needs to know first all the (small) subgraphs, or 
motifs, before we study their statistics from the data. 
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Before we give the details, it is perhaps worth making the following remark: 
Clearly, one way to solve problem (i) would be to first solve problem (ii), then 
choose uniformly from T). However, in (those very small) cases when reasonable 
answers can be expected for problem (ii), problem (i) is rather uninteresting. 
In general, however, (i) cannot be solved efficiently by the use of (ii). 

We start the discussion of problem (ii) with pointing out that our new, 
directed Havel-Hakimi type algorithm is unable to generate all realization of a 
prescribed DBS (see Figure [TJ. 

(2,1) (2,1) 




(2,1) (2,1) 

Figure 1: This graph cannot be obtained by the directed Havel-Hakimi procedure. 
The integers indicate node degrees. 

The situation is very similar to the non-directed case, see [9]. The directed HH- 
algorithm must start with a vertex with degree-pair (2,1), therefore the two 
vertices of degree-pair (0, 3) must be out-neighbors of the same vertex - not for 
the graph in the Figure. 

One possible way to overstep this shortage is to discover systematically all pos- 
sible out-connections from a given vertex v in all realizations of the prescribed 
graphical BDS. 

We do not know a greedy algorithm to achieve this. The next best thing 
we can do is to develop a greedy algorithm to decide whether a given (sub)set 
of prescribed out-neighbors of v would prevent to find a realization of the BDS 
containing those prescribed out-neighbors. In the following, we describe such a 
greedy algorithm. (It is perhaps interesting to note that this latter problem can 
be considered as a very special directed /-factor problem.) 

To start, we consider a (d+,d^) bi-degree sequence together with a for- 
bidden vertex set F whose elements are not allowed to be out-neighbors of 
vertex Vn. (Or, just oppositely, we can imagine that we already have decided 
that those vertices will become out-neighbors of Vn and the BDS is already 
updated accordingly. The forbidden vertex set governs only the out-neighbors, 
since in the process the in- neighbors are born "automatically".) It is clear that 
|i^| -I- 1 + < n must hold for the existence of a graphical realization of this 
F-restricted BDS. 

Assume that the vertices are enumerated in such a way that subset F con- 
sists of vertices w„_|/^-| , . . . , w„_i and vertices V' = {wi, . . . , w„_|i?|_i} are in 
normal order. (We can also say that we apply a permutation on the subscripts 
accordingly.) Then we say that the BDS is in F-normal order. 

Definition 8 Consider a bi-graphical BDS sequence (d+,d^) in F-normal or- 
der, and let A be a PON. The A-reduced BDS (d+|^,d^|^) is defined as in 
and while keeping in mind the existence of an F set to the right of A. 
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In other words, if ^ is a PON in an _F-restricted BDS, then the reduced degree 
sequence (d+|^,d~|^) is still obtained by removing the out-edges of node w„ 
(according to the possible out- neighborhood A). 

Finally, one more notation: let (d+,d~) be a BDS, F a forbidden vertex 
subset of V and denote by F[k] the set of the first k vertices in the F- normal 
order. 

Theorem 9 Let A be any PON in the F -restricted (d+,d^) BDS, which is in 
F-normal order. Then if the A-reduced BDS (d+|^,d^|^) is graphical, then 

the F[d^]-reduced BDS ^d+ , d^ is graphical as well. 

Proof. It is immediate: Lemma [S] applies. □ 

This statement gives us indeed a greedy way to check whether there exists a 
graphical realization of the F-restricted bi-degree sequence (d+ , d~ ) : all we have 

to do is to check only whether the F[d+]-reduced BDS ^d+ , d~ is 

graphical. 

Finally, we want to remark that, similarly to the indirected case. Theorem [S] 
is suitable to speed up the generation of all possible graphical realizations of a 
BDS. The details can be found in [9] which is a joint work of these authors with 
Hyunju Kim and Laszlo A. Szekely. 
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